The elusive source of quantum effectiveness 
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We discuss two qualities of quantum systems: various correlations existing between their subsys- 
tems and the distingushability of different quantum states. This is then applied to analysing quan- 
tum information processing. While quantum correlations, or entanglement, are clearly of paramount 
importance for efficient pure state manipulations, mixed states present a much richer arena and re- 
veal a more subtle interplay between correlations and distinguishability. The current work explores 
a number of issues related with identifying the important ingredients needed for quantum informa- 
tion processing. We discuss the Deutsch-Jozsa algorithm, the Shor algorithm, the Grover algorithm 
and the power of a single qubit class of algorithms. One section is dedicated to cluster states 
where entanglement is crucial, but its precise role is highly counter-intuitive. Here we see that 
distinguishability becomes a more useful concept. 
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I. INTRODUCTION: CLASSICAL AND 
QUANTUM CORRELATIONS 

Correlations are ubiquitous in nature. The future 
observations we make of a system under study will in 
general be dependent on our past observations and the 
knowledge we have extracted based on them. Although 
we do not generally understand why events we observe 
around us are correlated in the first place, correlations 
themselves are very simply quantified within the frame- 
work of Shannon's information theory p|. Suppose we 
perform measurements on a given system (or set of sys- 
tems) repeatedly at different instants of time, ti,t%, ...tpj. 
Let us record the outcomes of our observations as a 
sequence xx, X2, ...Xjsi. Different sequences of outcomes 
will naturally have different probabilities associated with 
them, which we will denote p(xx, %i, ...xn). Correla- 
tions now mean that this probability will most generally 
not be expressible as product of probabilities of subse- 
quences, p{xx, ■■■x n ) x p(x n +xi ■■■%n) f° r any I < n < N. 
Shannon introduced the notion of mutual information 
in order to quantify how correlated different observa- 
tions are. For simplicity, if we divide measurements 
into two groups, A and B, each of them having a well 
defined probability distribution, p(A) and p(B) respec- 
tively, as well as a joint probability distribution, p(A, B), 
then the mutual information between A and B is de- 
fined as I(A : B) = H{A) + H{B) - H(A, B). Here 
H(X) = —J2xexP( x )^°SP( x ) is the well-known Shan- 
non entropy. There is a certain degree of subtlety in 
trying to extend Shannon's mutual information to more 
than two different sets of outcomes, but this issue will 
not concern us in the current exposition. 

The concept of mutual information is so general that 
it can easily be extended to quantum systems [2]. This 
leads us to the notion of quantum mutual information, 
which, for a general state gab, is defined as I (cab) — 
S(cta) + S(a B ) - S(o- a ,b), where S(p) = -trplogp is 
the von Neumann entropy and a a and <tb are the re- 



duced density matrices of state oab- However, in quan- 
tum mechanics, we have learnt to discriminate between 
different forms of correlations, a distinction that has 
no counterpart in classical information theory. First of 
all there is entanglement. Given a bipartite quantum 
state ctab, entanglement presents any form of correla- 
tion that cannot be captured by the states of the form 
Si PiP\ ® Pb (which are known as separable or disentan- 
gled). Entanglement in oab is then most easily quanti- 
fied by calculating how different this state is to any sep- 
arable state [1, 0| • This difference can be expressed in a 
number of ways, but the related details will not trou- 
ble us at present (see, for instance, 0]). Among the 
separable states, however, there are those that we can 
call classically correlated. This will simply mean that 
we have orthogonal states for subsystem A, call them 
\k), and orthogonal states for system B, \l), and the 
probabilities corresponding to them, pki will not simply 
just be equal to pt <£> pi- Classically correlated states 
would therefore have a general form ^ k iPki\k){k\ ® 
There are clearly separable states that are not just clas- 
sically correlated in this very sense. One example is the 
state P ab = 1/2(|0)(0U ® |0)(0| B + \1)(1\ A ® |+)<+| B , 
where |+) = (|0) + \l))/s/2. It will become transpar- 
ent later why we need to discriminate between separable 
states and classically correlated states. We can also use 
some entropic measure to quantify how different separa- 
ble states are from classically correlated ones and this too 
will be discussed shortly. The states containing no cor- 
relations, either quantum or classical, are called product 
states, pa ® Pb- 

An equivalent way to Shannon's of quantifying corre- 
lations is to think of the reduction in entropy of A(B) 
when B(A) is measured. The more correlated A and B, 
the more we can learn about one of them by measuring 
the other. Suppose we make measurements on A. For 
each measurement outcome i, occurring with probabil- 
ity pi, the state of B will collapse to p % B . Classical cor- 
relations in a state pab are then simply the maximum 
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over all measurements performed on A of the quantity 
C(par) = S(p B ) - J2 l Pi S (P l B) ( as defined in 0; see 
also [3]). We can also define this quantity by swapping 
the roles of A and B, but the subtleties related to the 
question of symmetry of classical correlations will be not 
relevant for our present discussion. 

For classically correlated states it is clear that their 
quantum mutual information and classical correlations 
lead to exactly the same measure of correlations. What 
is rather intriguing, however, is that for separable states 
the mutual information is generally larger than classical 
correlations. This means that separable states contain 
correlations over and above just the classical ones. The 
discrepancy between the two is known as the quantum 
discord, D = I — C. We will call discord the correla- 
tions over and above classical, but excluding entangle- 
ment. (Note: in [8| the discord is defined to contain 
entanglement as well). 

The general picture is this. Quantum mutual infor- 
mation in any quantum state o ab can be written as 
I = E + C + D, where E is the amount of entanglement 
in the state (as measured by the relative entropy of en- 
tanglement [3| to make it on an equal footing with other 
entropic measures of correlations). Physically this means 
that the quantum mutual information measures total cor- 
relation in a quantum state, which can be though of as 
consisting of entanglement, E, classical correlations, C, 
and the additional quantum correlations, D, which are 
not due to entanglement. For pure states, discord always 
vanishes and total correlations are conveniently equal to 
the sum of entanglement and classical correlations @. 
Moreover, both entanglement and classical correlations 
in this case are equal to one another and to either of the 
reduced von Neumann entropies. This is an expected 
consequence of the Schmidt decomposition of pure bi- 
partite states 0. For mixed states, discord is generally 
non- vanishing, and this seems to hold important impli- 
cations for quantum information processing, the topic of 
main focus in the present paper. 



II. INFORMATION PROCESSING AND 
DISTINGUISHABILITY 

To motivate the forthcoming discussion we first ask 
the question: what feature of quantum mechanics makes 
quantum information processing more efficient than clas- 
sical? It has frequently been said that entanglement is 
clearly that feature. The answer seems obvious in the 
case of pure states. If there is no entanglement (or very 
little of it) during the evolution of pure states, then that 
evolution can efficiently (with only a polynomial over- 
head) be simulated by classical systems [9f]. But, we 
should remember that according to our above discussion, 
pure states contain the same amount of classical correla- 
tions as entanglement. Therefore, we might well say that 
it is classical correlations in pure states that are respon- 
sible for the speed-up! The picture, however, changes 



dramatically for mixed states. First of all, any evolution 
of just classically correlated states can be simulated by 
classical computers (by definition, for these state define 
what we mean by classical computers); therefore clas- 
sical correlations cannot be, on their own, responsible 
for the speed-up. Furthermore, it is possible to have a 
speed-up with just separable (more than classically cor- 
related) states and this means that entanglement cannot 
be responsible for the speed-up either. If we look among 
correlations for the culprit, then we are only left with 
the discord, which is non-zero for mixed separable, but 
non-classically correlated states. This conclusion, how- 
ever, would not be consistent with the pure state anal- 
ysis, where discord is non-existent. We are finally left 
in an uncomfortable position: none of the correlations, 
quantum or classical, can (singularly) be responsible for 
the speed-up of quantum information processing! 

Besides correlations, we have the concept of indistin- 
guishability in quantum mechanics, namely the fact that 
different quantum states, unlike classical, need not, even 
in principle, be distinguishable from one another. This 
fact is key in quantum communications in general, and 
quantum cryptography in particular. The fact that Al- 
ice encodes two messages, and 1 into non-orthogonal 
quantum states |0) and |+), makes it impossible for any 
eavesdropper to remain un-noticed. Information cannot 
be extracted from non-orthogonal states without disturb- 
ing them. In fact, quantum computation can also be 
viewed as a form of information processing where differ- 
ent, in general non-orthogonal, outputs have to be dis- 
criminated from each other (see [l(| for an exposition of 
this view). This is why we might expect that separable 
states with non-zero discord could still be more efficient 
than just classically correlated states. 

To illustrate how discrimination enters computation, 
let us look at the concrete example of the Deutsch- Jozsa 
algorithm This particular problem achieves an ex- 
ponential speed-up over classical problems. Here, we are 
promised a function which is either constant (all outputs 
are or all are 1) or balanced (outputs contain an equal 
number of zeros and ones). If we are restricted to a single 
application of the function, it is clear that classically we 
cannot obtain any information. Knowing the value of a 
single bit out of N bits, implies no knowledge of the rest 
N — 1 bits. Quantumly, however, we can think of the 
evaluation of the function on x as the implementation 
of the phase factor e m f\ x >. Then, if we input the state 
|+) <g> |+) ® ...|+), where |+) = |0> + |1), the output will 
either be the ±|+) ® |+) ® ...|+) if the function is con- 
stant, or it will be one of the orthogonal states containing 
the superposition of all states with half of the phase fac- 
tors negative. Thus the final measurement is a simple 
orthogonal, projective measurement to discriminate the 
two case. It can be shown that here entanglement will 
exist in general among the states resulting from the ap- 
plication of a balanced function [1 21] . We will therefore 
turn to mixed states in order to show that entanglement 
is not needed for the higher quantum efficiency. 
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Imagine that the input state is a mixture of the max- 
imally mixed state on N qubits, with the corresponding 
probability 1 — e, and the pure state | +) ® |+) ® — | +) , with 
the probability e. These states arise, for example, in the 
liquid state Nuclear Magnetic Resonance quantum infor- 
mation processing, and we will henceforth refer to them 
as pseudopure (I am being a bit cavalier with mathe- 
matics here: the natural states in NMR are the thermal 
Gibbs state, but they could, for all practical purposes, 
be approximated well by pseudopure states. Pseudopure 
state are mathematically easier to handle which is their 
chief appeal) . Providing that e < l/(2 2n_1 + l), this state 
will never become entangled under any unitary evolution 
[HI (and hence any functional evaluation in the Deutsch- 
Jozsa algorithm, for example) since it is sufficiently mixed 
that a separable decomposition is always possible. How- 
ever, no matter how small the e, we can show that some 
non-zero information can still be obtained regarding the 
nature of the function. This is because the resulting (out- 
put) two mixed states, corresponding to the constant and 
the balanced function respectively, can always be partly 
discriminated. How much information can be obtained is 
conveniently quantified by the Holevo bound. This looks 
at the difference between the entropy of the mixture of 
the two states minus the average of the entropies of the 
individual states. 

Suppose that the probability with which we are given 
a balanced function is p (and so the probability for the 
constant function is 1 — p). Then the final state of the 
computer is 

p f = P U b (l-e)I+e\+){+Dul (1) 
+ (l-p) Uc (l-e)I+e\+)(+rM (2) 

where U byC are the unitary transformations implementing 
the balanced and the constant function respectively. The 
information we can now extract about the nature of the 
function is 

2°* = S( Pf )-pS(U b (l-e)I + e\+){+\® n Pt) (3) 
- (l-p)S(U c (l-e)I + e\+)(+\® n M) (4) 

We can show that in the limit of small e, which is what we 
require to make the state always separable, this quantity 
scales as I out » 2™e 2 + (9(e 3 ) Q. 

Let us compare the output information gain to the cor- 
relations of one qubit with the rest N — 1 qubits. Clas- 
sically we can only measure one bit, and we said that 
this would give us no information about the nature of 
the function. This is because the state of one bit is in 
no way correlated to whether we have applied U b or U c . 
Quantumly, however, although there is no entanglement 
in the final state, we do have a finite discord. Again 
in the limit of small e the discord is calculated to be 
D ps 2"e 2 -I- 0(e 3 ) (to be shown in the next section), and 
is therefore directly related to the information we obtain, 

jout 

It is important to stress that the relationship between 
information out and the discord only holds under the 



assumption that e is small (so that we can make a Taylor 
expansion of various entropies to their lowest order). We 
will see in the next section that this translates into e < < 
2~™. This limit is in perfect accord with the fact that we 
require e < l/(2 2 ™ -1 + 1) in order to guarantee separable 
states. (At the other extreme, when we consider pure 
states, we have already noted that the speed-up occurs 
even tough the discord is always zero). 

The relationship between information obtained and 
discord for highly mixed states is not accidental. We 
now proceed to show its exact form for general promise 
type problems. 

III. DISTINGUISHABILITY AND DISCORD 

Suppose we start with a pseudopure mixed state of n 
qubits, p n = (l-e)I/2 n + e|0)(0|® n . Furthermore, imag- 
ine that we are promised N different properties encoded 
into unitaries U\, U2---Un, with the respective probabili- 
ties p\,p2, ■■■Pn- The amount of information we can ob- 
tain at the end about different properties is, as we have 
seen, given by I out = SQ^iPiPi) ~ HiPi s i.Pi) where 
Pi = Uip n Uj. This expression can be simplified since 
S(pi) has the same value for all the outcomes (because 
they are all unitarily related with the input state and 
hence must have the same entropy as the input state). 

Let us now look at the discord between the first qubit 
and the rest in the final state after one unitary has been 
applied. It is given by 

D=l-S(p n )+S(p n ^) . (5) 

We can easily compute both S(p n ) and S(p n -\), 

S(Pn) = -(^+6)l0g(^+ £ ) (6) 

~ (2"-l)xi^log(^) (7) 
«n- 2"e 2 . (8) 

By the same token S(p n ) » n— 1 — 2" _1 e 2 . The discord 
is now, to the lowest order in e equal to 

D » 2"- 1 e 2 . (9) 

The mutual information, on the other hand, is calculated 
to be 

jam = s^PiPi) ~ S(jh) <n- S{ Pi ) (10) 

i 

S3 n-(n- 2"e 2 ) = 2"e 2 = 2D (11) 

Therefore, here we have a general inequality that the 
amount of information we can extract, which tells us 
about the efficiency of our quantum information process- 
ing, is bounded by (twice) the discord. This immediately 
shows that if the discord is zero, then no information can 
be obtained within this framework. 
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In computing the discord we have assumed that the 
final pure state in the mixture is of the form |0) ® \^o) + 
|1) (g> |\I>i). The states |* ) and need not be orthog- 
onal in general, though in the Deutsch-Jozsa algorithm 
they certainly are in the case of the balanced function 
(this is true when three and more qubits are concerned; 
for two qubits there is no entanglement present anywhere, 
at any stage of computation). 

Note, again, that this is in no contradiction with the 
fact that pure states have zero discord and yet lead to 
a quantum advantage. The reason is that we were as- 
suming here that e is so small (because we insisted on 
separability of all states involved) that we could make an 
approximation to both the discord as well as the informa- 
tion gain. For nearly pure states this approximation, of 
course, fails, and the discord is no longer an appropriate 
upper bound. 

Can anything be said about more general algorithms? 
We have so far only required a better than classical ef- 
ficiency. What happens if we are a bit more demanding 
and ask for a significant difference in efficiency? 

IV. DISCUSSION 

We can now raise the bar and ask for quantum pro- 
tocols that are not just more efficient than classical, but 
exponentially so (though at the end of the section we will 
see that even a polynomial speedup can in some cases 
be addressed by similar means). Exponential efficiency 
means that the time it takes to reach the answer scales 
quantumly as a polynomial of the number of (qu)bits re- 
quired for memory, while it takes an exponential time 
for any classical computer (Note: it is important that 
we keep the memory polynomially bounded). We ask if 
entanglement is needed in this case. 

We can answer this question in the affirmative in one 
special case [l5j . Suppose that our initial state is a pseu- 
dopure state. If the pure fraction is e, then the number 
of times we need to repeat the computation to obtain 
a correct result is of the order of 1/e. If we assume, 
like in Deutsch-Josza, that we are computing some (non- 
constant) function /, then the pure part of the pseudop- 
ure mixture will generally evolve to be \x) ® \ f{x)) 
(this is also true for Grover's [l6| and Shor's [TtJ algo- 
rithms). Let us now look at the entanglement between 
the first and the second register in the pseudopure mix- 
ture. We project the first register onto the |0), |1) sub- 
space (without destroying the coherence between the two 
states) . Then, we obtain an effective two qubit pseudop- 
ure state 

P2X2 = (1-8)1+6 7 = 

(12) 

where S = 1/((1 — e)2 " 2+1 + e) and n 2 is the num- 
ber of qubits in the second register (roughly equal to 
n/2 in general). Since this is a two qubit state, it is 
entangled if and only if <5 > 1/3 which implies that 



e < l/(2" 2 + 1) w 1/2™/ 2 . If e is not in this domain than 
the original n qubit pseudopure state must have been en- 
tangled (since we performed a local projective measure- 
ment and this, by definition, cannot create entanglement 
out of a separable state). To avoid entanglement, there- 
fore, the pure fraction must not be greater than e « 2~"/ 2 
and this means that the resulting computation cannot be 
exponentially more efficient than classical. 

The whole above discussion can conveniently be 
phrased in terms of distinguishability. If your input state 
is too distinguishable from the pure state that would yield 
the maximum quantum efficiency, then there is no gain 
in using it for quantum computing. In this case we can 
use the relative entropy to quantify this distinguishability 
[l8| . This is an asymptotic measure that is only achieved 
in the limit of large number of trials, but it otherwise pro- 
vides an upper bound for any finite case scenario. Sup- 
posing again that we work with a pseudopure state, this 
leads to: 

S(|*)(*|||e|*)<*| + (l-e)^) = -log(e+^) 

« logi. (13) 
e 

The probability that the pseudopure state will be con- 
fused with |W) (^1, which means that we have a successful 
outcome, is simp ly g iven by the exponential of the above 
relative entropy [18( 

ex P {-5(|*)<*|||c|*)(*| + (l-e)^)} = e. 

This is the same conclusion as above, namely we need 
to repeat the computation roughly 1/e times to have a 
unit success. If, furthermore, we require the pseudopure 
state to be distinguishable from a separable state, then 
e < 1/2™ and hence there is no exponential speed up in 
the resulting computation. 

This argument is very simple, but what does it mean? 
Does it mean that, for example, Shor's algorithm def- 
initely uses entanglement? If our input is pseudopure, 
then the answer is yes. However, if we use a mixture 
of another type the answer remains unknown (although 
there is numerical evidence that entanglement always ap- 
pears |l9|). What is more, we have evidence that differ- 
ent mixtures can achieve exponential speed-up without 
entanglement (or with very little of it) in another in- 
stance. This instance is, in fact, also another clear exam- 
ple of the connection between discord and distinguisha- 
bility and lies at the heart of the efficiency of quantum 
computation. The task is to compute a trace of a unitary 
matrix and can be accomplished with one non-maximally 
mixed qubit and a completely mixed register of n qubits 
H|. Here the input state is of the form |+)(+| ® I/2 n 
(note: this is not a pseudopure state). The said unitary 
is applied only if the first qubit is in the state |1)(1|. This 
leads to the state 

Pu = (|0)(Q| + |l)(l|)®^ + |0)(l|®I7t + |l)(0|®l7 (14) 
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(The trace of U is now extracted by measuring the first 
qubit in the |±) basis). It is clear that here there is no 
entanglement between the first qubit and the mixed regis- 
ter. Furthermore, we can make the first qubit arbitrarily 
close to the mixed state (where mixedness is constant, 
i.e. independent of the number of qubits) and still have 
an exponential quantum advantage [21] . It is tempting 
therefore to look for reasons other than entanglement to 
explain the speedup. Discord is certainly one such mea- 
sure (as proposed in (HI), but this, again, is related to the 
distinguishability between the states of the mixed regis- 
ter resulting from measuring the qubit in the |±) basis. 
Therefore, here a similar link exists between speedup, 
discord and distinguishability that we also found in the 
Deutsch-Jozsa algorithm. Note that (in line with what 
has been said) there is evidence that this algorithm is 
difficult to simulate classically, even tough the overall 
entanglement scales poorly (or is non-existent) [221 ]. 

Grover's algorithm [l6| is interesting to mention here 
simply because the speedup is only polynomial, i.e. 
quadratic, but the algorithm is of a very general na- 
ture (since any difficult problem boils down to a search). 
It has been established that pure state Grover's algo- 
rithm in general contains entanglement between qubits 
(l2T |. There have been claims that search can be done 
without entanglement, but this is only true, so far as we 
can tell, when the memory encoding is inefficient. For 
example, a "classical laser beam" (meaning: a coherent 
state with a large average number of photons) can per- 
form Grover's search using a diffraction grating encoding 
a database to be searched 23]. Each slit in the grat- 
ing here represents a different database element. It is 
clear that this is less efficient than using (qu)bits to en- 
code the database since we only need log AT qubits to 
encode N database elements whereas we need N slits in 
the diffraction grating (i.e. exponentially more spatial 
resources) to do the same. Here entanglement is thus 
linked with efficient spatial encoding. 

What happens if we use the pseudopure states to run 
Grover's algorithm? It has been shown that if e > 
1/ log AT, then Grover's search algorithm is still more effi- 
cient than any classical algorithm [24j (this immediately 
follows from the fact that the probability of success is 
reduced by e when we have pseudopure states. If e scales 
as a polynomial of the number of qubits n — log N, then 
we only need polynomially many repetitions to achieve 
Grover's quadratic speedup. The overall quantum time 
is therefore w y/N /log AT sw y/N). However, as we have 
seen, these states are, in fact, entangled. The reason is 
that all states of the form in eq. (|12p also occur during 
Grover's algorithm (in Grover's search /(0) = 0, and 
this represents all irrelevant database elements, while 
/(l) = 1 corresponds to the database element we are 
looking for; at some stage of the algorithm the two will 
have comparable amplitudes, which is all we need in this 
argument). Here, therefore, as soon as we require any- 
thing faster than classical (which scales as N) we imme- 
diately have entangled pseudopure states. We cannot, of 



course, rule out that there are some other mixed states 
that are separable and yet achieve a quadratic speedup. 
However, for states where some qubits are pure and oth- 
ers are maximally mixed (as in the case of the power of a 
single qubit), evidence points to the necessity for entan- 
glement im. 

A general point deserves special attention in our dis- 
cussion. It appears that a strong criterion for quantum 
effectiveness is the fact that the state throughout the 
computation is sufficiently different to classically corre- 
lated state (though this need not mean that it is en- 
tangled at any stage). We have seen that the relative 
entropy conveniently tells us about how distinguishable 
two states are. Suppose, therefore, that we ask a less 
demanding, but related, question: how distinguishable is 
a maximally noisy state from a given pseudopure state? 
This can be computed to be: 

S(e|*X*| + (l-e)^||p c )= (15) 

-S(e|*)<*| + (l-e)^)- (16) 

tr{ e |*)<*| + (l-e)^logp c } = (17) 
2"e 2 + 0(e 3 ) (18) 

(again assuming a small e expansion). Let us now com- 
pute this same quantity, but for the power of a single 
qubit state and see if and by how much more this state is 
distinguishable from a maximally noisy one. The relative 
entropy is now given by: 

S(U{\0){0\®^}UU\Pc) = l. (19) 

It is clear that the state single pure qubit state is much 
more distinguishable (exponentially more so!) from pure 
noise than the pseudopure state (bearing in mind that 
e w 2 _n ). This direction will require much further re- 
search, but can we say that this kind of distinguishability 
is at the root of the efficiency of some states and evolu- 
tions as opposed to others? The mathematical intricacy 
will lie in, firstly finding the best classically correlated 
state to approximate our quantum state (something that 
I conveniently avoided doing above) and, secondly, doing 
this for each instance in time as the state evolves in a 
unitary fashion. 

V. DIGRESSION: CLUSTER STATES 

There exists a computational model exempt from 
above considerations in that entanglement is definitely 
a necessary resource for it. Here I have in mind the so 
called cluster state quantum computation (or measure- 
ment based quantum computation) [25]. This form of 
computation consists, in fact, of a sequence of one qubit 
measurements followed by a feed-forward of the infor- 
mation contained in the measurement outcomes. Mea- 
surements are performed on highly entanglement initial 
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states of many qubits, the so called cluster states. Typ- 
ically, parts of clusters are measured so that (as a con- 
sequence) the remaining, unmeasured, qubits undergo a 
desired computation. The feed-forward of classical infor- 
mation ensures that the evolution of the remaining part 
is unitary (deterministic) in spite of being driven by mea- 
surements. 

Cluster state computation is, simply speaking, a gen- 
eralization of the teleportation protocol to other, more 
complicated, algorithms. Just like in teleportation, en- 
tanglement is crucial for cluster state computation. A 
cluster state that is only separable cannot achieve any 
advantage over classical computers. However, and this 
is at first sight a surprising fact, too much entanglement 
in clusters can also be detrimental to quantum compu- 
tation 26]. This is because a highly entangled state, 
when measured, tends to give the output that is indistin- 
guishable from a maximally mixed state (note: here the 
distingishability is between the output bit strings made 
up of measurement outcomes and not between the states 
themselves, though the two are, of course, somewhat re- 
lated). We now proceed to explain what it means to be 
highly entangled. 

We can phrase the success probability for computation 
in terms of relative entropy in the following way (for a 
more detailed and rigorous analysis see [26j]): 

1. Consider the sequence of bits b = (pi, &2, gen- 
erated by making one qubit measurements on the 
entangled state used for measurement based com- 
puting. 

2. The probability of getting a particular string is 
p(b) = \(bi,b 2 ,.-b N \^}\ 2 , where is the entan- 
gled state itself. 

3. Let the number of bit strings giving us a non-zero 
probability of success be N s . 

4. Then the total probability of success can be esti- 
mated to be 

p s =J2 \(biM,-b N m\ 2 < E e- E - =N S 2- E - 

ieN e ieN s 

where Eq, is (twice the log of) the geometric mea- 
sure of entanglement [27| (this itself is a lower 
bound on the relative entropy of entanglement, but 
for cluster states the two coincide). 

Computation should only proceed if the probability of 
success is finite, p s = c > 0, so that the correct result can 
be achieved by repetition (c is a constant independent of 
the number of qubits N). This implies that 

N s > 2- B * -log ( 1 / c ) 

Now, if entanglement scales as the number of qubits N 
(for large N, to be precise, entanglement in most states 
scales as N — log N, but this logarithmic correction is 
immaterial in the thermodynamical limit) then it follows 



that the number of successful solutions to our problem is 
equal to the total number of outputs (minus a constant 
factor log(l/c) which, again, is irrelevant in the large N 
limit). Therefore, the computation using such entangled 
state can be simulated using a completely random coin 
toss. It turns out that cluster states have exactly N/2 
units of entanglement which clearly does not lead to a 
trivial result from the above inequality (any reader in- 
terested in entanglement scaling in various many-body 
states could consult the elementary review in [28j). Any- 
thing much smaller than this (e.g. log TV) would be insuf- 
ficient as a universal resource, but for a different reason: 
this state, and measurements made on it, could be simu- 
lated by classical means. 

It is interesting to note that entanglement in cluster 
states behaves very much like the free energy or entropy 
(hence too much entanglement in a state is akin to too 
high an entropy of the resulting computation). Suppose 
that we have a mixed state due to the system being at 
a finite temperature, T. The probability that we are in 
the ground state is then pa = e(~ Ea+F " kT where Eq is 
the ground state energy and F = — kT log Z is the free 
free energy (Z being the partition function). Let us, for 
simplicity, assume that Eq — and that kT = 1. The 
estimate for the number of successful bit strings then 
becomes, 

N > 2 B *-- F - 1 °s( 1 /c) 



(since p s < N s p G 2^ E<!! = N s 2- E * +F ). This shows that 
if Eq, — F sa N, the state is useless for quantum compu- 
tation. There is here obviously a tradeoff between free 
energy (or entropy) and entanglement. Too much en- 
tanglement simply implies too low a free energy (or too 
high an entropy) which has the "effect" to generate too 
much noise in the output. This is (thermodynamically 
speaking) why the output then becomes indistinguish- 
able from a random state. In some sense, performing 
cluster state quantum computation, is analogous to do- 
ing useful work, which is only possible if the state has 
non-zero free energy, i.e. if it is sufficiently different to a 
maximally mixed state (for a more in-depth discussion of 
analogies between clusters and thermodynamics see [29j). 

The bottom line, ultimately, is this. The fact that en- 
tanglement is needed for cluster states, does not mean 
that we cannot achieve significant speedups without it, 
simply because cluster states are just one way of execut- 
ing quantum computation. Though clusters are a univer- 
sal resource, entanglement in them is really a substitute 
for the missing unitary dynamics (given that only sin- 
gle qubit measurements are at our disposal). So, even 
here, it is not clear which resource is responsible for the 
quantum effectiveness. The focus would have to shift to 
measurements and the effect of noise on them as well as 
the ensuing computation. 
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VI. CONCLUSION AND OUTLOOK 

In addition to the search for the source of power of 
quantum information processing, there is a related phys- 
ical issue about the relationship between the concepts of 
superposition and entanglement. It is, in fact, very dif- 
ficult to discuss the cause (or causes) for the quantum 
information speedup, without immediately running into 
some fundamental physical issues (information, after all, 
is physical). I will use a simple example related to above 
discussion to illustrate the point. 

Suppose that the pure qubit in the example of "the 
trace computation" (the power of a single qubit) is a 
photon entering an interferometer. Then, after the first 
beamsplitter, the photon is in the state of a superposi- 
tion of two paths. However, this state can also be con- 
sidered an entangled state as it is written as |01) + |10). 
In fact, we have shown elsewhere that this state can vi- 
olate Bell's inequalities [3.0] and is, therefore, a legiti- 
mate (though single particle) entangled state on a par 
with the state \HH) + \VV) of two photons in, say, 
parametric down conversion (if, V stand for horizontal 
and vertical polarisation respectively). The subsequent 
unitary operation conditional on the second mode be- 
ing |1) is then just a local unitary transformation ap- 
plied to the second mode and it must thus preserve 
the original entanglement. In fact, even if we start 
with a mixed state (as long as it is not completely 
mixed), our resulting state will still always be entan- 
gled between the two spatial modes (since it is just given 
by the mixture + (1 - p)|* - )(* - |). This 

state has the relative entropy of entanglement equal to 
E = 1 + plogp + (1 — p) log(l — p) 3. Within such sin- 
gle pure photon implementation, entanglement is clearly 
always present and could therefore be said to be respon- 
sible for the speedup (as much as any classical or total 
correlations are). 

We are at the end of our search for the source of quan- 



tum effectiveness and one conclusion can safely be drawn: 
we should give up looking for a single reason behind 
the quantum speedup. Most likely, the answer will inti- 
mately be connected with the exact nature of the problem 
and, as seen above, will vary from problem to problem. 
Though possibly intellectually displeasing, this answer is 
the only possible consistent one at present. This leads us 
to the following final thought. 

Let us at the end of our investigation take a broader 
view of information processing. Beyond man-made com- 
putational devices, there are, of course, much older and 
more ubiquitous information processors in nature - the 
living systems. All living systems are very opportunis- 
tic (possibly even more so than theoretical physicists) 
and what matters to their survival is to be able to gain 
even the smallest available advantage over their competi- 
tors. In natural information processing a fraction of a 
second speedup over and above one's predator, for in- 
stance, makes all the difference in the world. Nature 
could not care less about exponential improvements - it 
simply does not see beyond the next step. Any improve- 
ment that results in a higher chance of survival will sim- 
ply suffice (though in the long run, all the incremental 
steps may ultimately add up to an exponential improve- 
ment). Therefore, if we generalise our question and ask 
whether quantum physics could improve certain natural 
operations, it is to be expected that entanglement may 
no longer be the most important resource. All tricks of 
the quantum trade will then be exploited, very much in 
the spirit of the present discussion. 
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